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COMMENTS  ON  "A  SIMPLE  PROOF  OF  THE  SEPARATION 
THEOREM  FOR  LINEAR  STOCHASTIC 
SYSTEMS  WITH  TIME  DELAYS" 


Anders  Lindquist 


The  above  mentioned  proof*  contains  a subtle  circular  argument,  which  is 

2 

rather  common  in  the  literature;  it  can,  for  example,  also  be  found  in  [3]  . A 
few  comments  are  therefore  in  order. 

The  authors  consider  a linear  stochastic  system  with  time  delays  having 
state  process  x,  input  (control)  process  u and  output  process  z.  (The  fact 
that  there  are  time  delays  in  the  system  is  not  important  here.)  The  problem 
is  to  devise  a nonrandom  feedback  loop 


u(t)  = Tr(t,z(s);  0 < s < t) 


so  that  the  resulting  feedback  equations  have  a unique  solution  and  a quadratic 
criterion  is  minimized.  To  emphasize  the  dependence  of  x and  z on  the  con- 
trol process  u,  we  shall  here  write  xu  and  z^.  Using  the  transition  matrix, 
the  state  equation  can  be  integrated  to  attain  the  form 


xu(t)  = xo(t)  + 


K(t  ,s)u 

J0 


(s)  ds 
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where  K is  a matrix  kernel  and  the  index  o indicates  that  no  control  is 
applied.  Given  any  two  controls  u and  v,  define 

x - (s|t)  = E(x  (s)  I z (t) ; 0 < r s t). 
u I v u ' v 

The  proof*  is  based  on  a simple  conpletion-of-the-square  argument  adapted 
from  [3]  which  requires  that,  for  all  s s t,  the  estimation  error 

x , (sit)  = x (s)  - x . (sit) 
u|u  uv  u|uv  ' 

is  independent  of  the  choice  of  control  signal  u.  The  proof  of  this  fact, 
however,  is  based  on  the  assumption  that 

xu|u(slt)  = x0j0(slt)  + | K(s,t)u(T)  dT 

for  s s t (where  ^0|0  *s  t*lc  est^mate  obtained  when  there  is  no  control), 
which  in  turn  is  equivalent  to  xu|u  being  independent  of  the  control,  thus 
closing  the  circle.  In  fact,  a priori  we  know  only  that 

xu|i,(s|t)  = x0|u(s|t)  + J K(s,t)u(t)  dr, 

and  to  show  that  ^o|u  = ^o|o  we  muSt:  ^rst  prove  that  the  sigma-fields 
generated  by  (z  (t);  0 < t s t)  for  each  t are  independent  of  the  choice  of 
control  law. 

This  is  by  no  means  a trivial  question.  The  control  dependence  can  be 
eliminated  by  applying  the  Girsanov  transformation,  but  this  implies  that  the 
solution  of  the  resulting  system  equations  exists  only  in  a weak  sense,  and  con- 
sequently some  of  the  physical  meaning  of  the  feedback  problem  is  lost.  Hence 
we  shall  only  consider  strong  solutions.  As  explained  in  [10],  this  requires 
some  care  in  defining  the  class  of  admissible  control  laws. 

There  are  several  ways  to  modify  the  note*  so  that  the  proof  becomes  correct. 
The  simplest  is  to  allow  only  linear  feedback  laws,  i.e.,  it  belonging  to  the 
class  L defined  by 


7T(t,z)  = f(t) 
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[ F(t,s)  dzCs), 

J0 

where  f is  a vector  function  and  F a matrix  function,  both  square-integrable . 
It  is  not  hard  to  prove  [101  that  the  sigma-fields  {Z^}  are  constant  in  this 
case . 

Although  the  candidate  ir*  for  optimal  control  law  (see  equ.  4 in  the  note*) 
belongs  to  L,  we  usually  want  to  know  how  it  compares  with  nonlinear  feedback 
controls.  As  in  [11],  to  insure  the  sigma- field  constancy,  we  can  require  that 
it  be  a Lipschitz  continuous  function  of  the  sample  functions  of  z,  but  we  must 
realize  that  L is  not  contained  in  this  class  of  control  laws,  the  controls  of 
L not  being  defined  samplewise  unless  the  functions  s H-  F(t,s)  have  bounded 
variation.  Hence  we  must  impose  some  technical  assumptions  to  insure  that  it* 
is  admissible. 

A more  exhaustive  class  of  nonlinear  feedback  controls  can  be  defined  by 
first  observing  that  the  sigma-field  constancy  problem  does  not  occur  when  there 
is  a positive  delay  in  the  feedback  loop,  i.e.,  the  control  law  tt  is  of  the 
form 

u(t)  = TT(t,z(s) ; 0 < s ^ t - e) 

where  e > 0,  and  then  noting  that  properly  defined  limits  of  such  rr  will  also 
do.  (The  limit  in  probability  of  a sequence  of  random  vectors  will  retain  the 
measurability  property  of  the  sequence.) 

Finally,  in  the  proof*  it  is  necessary  to  assume  that  the  system  is  Gaussian, 
i.e.,  that  the  driving  noises  and  the  initial  conditions  are  (jointly)  Gaussian; 
otherwise  the  estimates  Xq|0  will  not  be  linear  in  the  data  as  required.  If 
all  conditional  expectations  are  replaced  by  wide  sense  conditional  expectations, 
the  Gaussian  assumption  can  be  dispensed  with  provided  that  the  analysis  is 
restricted  to  the  linear  class  L;  this  will  not  work  for  nonlinear  control  laws. 
Removing  the  Gaussian  condition  in  the  general  case  will  lead  to  nonlinear  fil- 
tering. Formally  the  optimal  solution  will  be  the  same  function  of  the  state 


